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Abstract 

r^ , A model based on a thermodynamic approach is proposed for predicting the dynamics 

Oh of communicable epidemics in a city, when the epidemic is governed by controlling efforts 

of multiple scales so that an entropy is associated with the system. All the epidemic details 
are factored into a single parameter that is determined by maximizing the rate of entropy 
production. Despite the simplicity of the final model, it predicts the number of hospital- 
ized cases with a reasonable accuracy, using the data of SARS of the year 2003, once the 
inflexion point characterizing the effect of multiple controlling efforts is known. This model 
is supposed to be of potential usefulness since epidemics such as avian influenza like H7H9 
in China this year have the risk to become communicable among human beings. 

<**> f Keywords. Epidemics, entropy, inflexion point 

1 Introduction 

en 

Starting from November 2002 till the end of May 2003, the severe acute respiratory syndrome 
VO [ (SARS) had spread widely over the world. Up to the end of May 2003 probable cases have been 

reported in 35 countries or regions, and the cumulative number of cases has reached 8202 by 
T^f-" \ May 26, 2003 according to the report by the World Health Organization (WHO). SARS in the 

year 2003 and avian influenza like H7N9 this year received or receive intensive attentions from 
all over the world due to its high case-fatality rate. People were particularly interested in finding 
the period of the time between infection and the onset of infectiousness, length of period that 
patients remain infectious, further infections that each patient produce, total number of infections 
during the epidemic, etc. A large number of publications have been reported for SARS, many of 
which have been included in reference books [TJ [2] and reviews [3J |3] ■ Important achievements 
have been made for the transmission dynamics using various mathematical models [5]-[T7] and 
reported data from Hong Kong or Canada. Donnelly et al[5], Riley et al [B] and Lipsitch et al 
[7] make use of the available data for SARS on latent, incubation and infectious periods and 
have successfully fitted their models to data describing the number of cases observed over time. 
The important conclusion is that if the SARS is uncontrolled, then a majority of people would 
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be infected. The potential effectiveness of different control measures has been studied in these 
references. 

Though SARS did not appear again since 2003, there may be other epidemic, such as H7N9 
avian influenza occurring actually in China possibly, spreading in a similar way. Hence the 
study of various models for the prediction of SARS and other epidemic once it occurs is always 
important. 

As assessed by Dye and Gay [8] , the current mathematical models are complex, the data are 
poor, and some big questions such as accuracy of case reports and heterogeneity in transmission 
remain. Dye and Gay anticipated that the next generation of SARS models would have to 
become more complex. 

It is now evident that SARS and maybe avian influenza in a city can be controlled through 
multiscale measures such as medical interventions, public-service announcements, isolation of 
people having contact with infected, restriction of individual and social activities, etc. When 
the interventions to control a communicable epidemic are intensive and of multiple scales, it would 
be very difficult to find all those details of the epidemic needed by a more complex model. It is 
thus desired that, under intensive and multiscale interventions, the global behavior of SARS or 
avian influenza spread, governed by a complex and multiscale system, could be roughly predicted 
without knowing the epidemic details. 

The dynamics of an epidemics is an important topic in biology, medicine, mathematics and 
physics and is usually modelled through differential equations [H]-[22], among which is the 
famous SIR (susceptible-infected-removed) model. The study on this topic is still very active 

(i-inram)- 

Most of the models for epidemics spread rely on differential equations for the susceptible, 
infected and removed numbers. Different spread mechanisms are embedded into the various 
terms in the differential equations. 

In this paper we are interested in the number of hospitalized cases (cumulative number of 
cases minus the number of deaths and the number recovered) and attempt to consider a new 
approach to predict this number. In our approach all the mechanisms controlling the spread are 
factored into a single parameter. Assuming the system controlling the spread of SARS or similar 
epidemic is a thermodynamic one, we define an entropy and determine the only parameter by 
using the principle of extreme rate of entropy production. This allows us to relate the dynamics 
of the spread to the information at the inflexion point of the curve describing the time variation of 
the number of hospitalized cases. The inflexion point is the date at which the multiple controlling 
measures take effect. 

The model presented in this paper is based on a simple differential equation with the spread 
rate forced to satisfy four constraints (section 2.1). The model is closed by the use of maximum or 
minimal rate of entropy production as the system for spread is assumed to be a thermodynamical 
one (section 2.2). There is a critical point (date) at which the spread rate turns to decrease due 
to the overall role of interventions. The maximum number of infected individuals and the time at 
which this maximum occurs can be related to the number and time corresponding to the critical 
date (section 2.3). This model is validated against the SARS data of the year 2003 (section 3) 
for which we are able to follow the history of the spread. 

2 Model development 

2.1 Basic model 

Let f(t) be the number of hospitalized cases, defined as the cumulative number subtracted from 
the cumulative number of deaths and recovery ones since death and recovery are also parts of 



actions in the thermodynamical system. Then the rate of increase (decrease) is proportional to 
the number at the previous day, 

with the roles of all the controlling mechanisms factored into the parameter a(i). Knowing the 
exact expression of a(t) requires the knowledge of all the details of the epidemic and the coupling 
with other differential equations. The essential idea in our method to find a(t) is to ignore any 
details but to use a thermodynamic approach. It is to be remarked that this parameter must be 
subjected to the following four constraints: 

1) The parameter a(t) must have the dimension of t -1 , i.e. a(t) ~ t _1 . 

2) At the initial stage there is an exponential increase for regular spread to start (since at 
the initial stage the number is near zero), i.e., a(0) — > oo. 

3) With the strong and active interventions the rate must decrease at a given day t = L 

which will be called the inflexion point (date). Mathematically this amounts to say that J t z 
vanishes at t = L, i.e., 

da 9 

— +a 2 =0,t = L 

at 

4) There must be a maximum for /(£), say at t = D, for which we have a(t) = 0. 

We assume that the virus causing an epidemic is constantly active (high temperature or 
intrinsic lifetime constraint would make the epidemic disappear suddenly, but this is not consid- 
ered here) so that a(t) is assumed to be an analytical function. Also nature would select laws 
as simple as possible. The only analytical function that meets the four constraints and that is 
simple enough is found to be given by 

-c\n(t/D) 
a(t) = (2.2) 

where c is a parameter. Inserting (|2.2j) into (|2.1[) leads to the following solution which is just the 
log-normal function, 

^-TJEi-K-Hs^) (2 - 3) 

Here k is a proportion constant, /i = In D + er 2 , and er is to be determined in the following through 
the use of the principle of extreme rate of entropy production. 

2.2 Principle of extreme rate of entropy production 

The principle of extreme rate of entropy production can be found in [27) . This has been suc- 
cessfully used to obtain the distribution of droplet production during its impingement on solid 
walls [3T]. Certainly, the width of the curve f(t)<xt can be characterized by a. The wider the 
curve is, the larger is the (Shannon) entropy. The intrinsic spread mechanism of virus and the 
large mixing activity of the population tend to make the curve wider (so a larger). However, 
the medical and social interventions to control the epidemic constitute a dissipation mechanism 
which would prohibit the curve to become infinitely wide (er infinitely large). The width would 



cease to increase when the maximum dissipation rate is reached. Maximum dissipation rate 
corresponds to extreme rate of entropy production, which again corresponds to 



d 2 S(a, V ) 
da 2 







(2.4) 



Here S(a i rj) is the Shannon entropy defined as 

p oo 

S(a,r}) = - / F(t)laF(t)dt r ' 
Jo 

where F(t) = t 1 ^^ f(t) with 77 = 3 in the usual entropy definition. Integration leads to 

S(a, rj) = 77 ( In ( V^ira) + rj (in D + a 2 ) + 

so that (|2.4[) holds if and only if 

1 

0.408 for r] = 3 
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2.3 Maximum number of hospitalized cases 



(2.5) 



Inserting (|2.3p into 



d 2 f(t) 
dt 2 



t=L 



yields the following relationship between D and L 



D = Lexp(-a 2 + -^c 



Using (E3D, /(£>) is related to f(L) by 



f(D) = /(L)cxp V - l -^Aa 2 +a±+ l - Qa + ^4 + ct 2 



With er given by (|2.5p , we have 



D 

T 



= 1649^1 
,=3 '/(£) 



= 2.120 



(2.6) 



77=3 



2.4 Initial date for regular spreading 



Once we know the inflexion date, it is crucial to determine when is the initial date for regular 
spreading of the epidemic. In other words, we must know the number L (cumulated days to 
reach the inflexion point counting from the initial date). This can be done by using the rate of 
increase -^j^- at t — L. A simple calculation using (I2.3[) yields 



df(t) 



which yields 



L 



dt 
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t=L 
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L 
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f(L) 
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df(t) 
dt 



t=L 



df(t) 
dt 



(2.7) 



t=L 




Figure 1: Inflexion point for Beijing. Note 
that the first peak is not an inflexion point 
but is simply due to report delay in the 
early period. 




Figure 2: Time history of the number of 
hospitalized cases for Beijing. 



3 Application and validation of the model 

3.1 Use of the model 

The model is used as follows. 

Step 1 (data recording) . Using the reported data we determine the number F at the inflexion 

date (the date that -^^ tends to decrease). Also determine -^- by using the reported date. 

Determine the proportion constant k in (12.31) by setting Jl = f(L). Then use (|2.7[) to determine 
L. 

Step 2 (Prediction). Once L and f(L) are known, use (J2.6I) to predict D and f(D) and plot 
the curve f(t) ~ t using eq (|2.3I) to predict the number f(t) for L < t. 

Hence it is essential to determine the inflexion point. Specifically, this is done as follows. We 
record the reported number f(t) for each day and draw the curve g(t) = g(t) — g(t — 1). Once we 
observe that g(t) reaches a peak (denoted as G) at t = L, then L is considered as the inflexion 
point. However, special cautions must be made. 

(a) in the early period of the epidemic, it is possible to have report delay of cases so that a 
false peak would occur. 

(b) for a city or region where the cumulative number of cases remains always small, it is 
difficult to observe a clear peak. In this case this approach does not apply. 

(c) There is also a possibility to have multiple inflexion points due to new outbreaks, as is in 
the case of Hong Kong, Singapore and Canada. 

Numerically, L is calculated as 

L = 3F/G (3.1) 

where F — (2F — G)/2 is the number of / averaged over two consecutive dates (at and before 
the inflexion date). Still using the log- normal function, we can relate the maximum H = f(D) 
and the date D to F and G by H w 2.12F and D f=a L + 2F/G where none of the constants 
depends on the details of the epidemic. 




Figure 3: For Hong Kong we observe three 
distinct inflexion points. 
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Figure 4: The prediction using the informa- 
tion at the three inflexion points (IP1, IP2, 
IP3) shows that the predicted curve using 
the first inflexion point is the closest to the 
reported data. 



3.2 Test of the model for SARS in 2003 

First consider Beijing. Using the reported date as shown in Fig [TJ we identify April 27 to be 
the critical date since -MJ- experiences an evident decrease after that date (we also observe a 
decrease before April 25, but that decrease is due to the report delay). Using the reported date 

we have /(£) = 980 and -^-rr- = 116. Hence L = 24 according to (13. II) . This means that 

at t=L 

the initial date for irregular spread is April 3. Using (J2.6I) we predict D and f(D) to be D = 42 
(May 13) and f(D) = 1955, while according to the report, D = 44 (May 15) and /(£>) = 1991. 
The predicted curve / = /(£) follows well the curve, as can be seen in Fig. ??. 

For Hong Kong, we observe three distinct inflexion points as can be seen in Fig [3] The 
prediction using the information at the three inflexion points (IP1, IP2, IP3) show that the 
predicted curve using the first inflexion point is the closest to the reported data (Fig. [4j> . More 
details can be seen in Table 1. In Table 1, when there are multiple inflexion points, as is in the 
case for Hong Kong, Singapore and Canada, we use the information at the first inflexion point. 
In the case of Singapore and Canada, there are two maximums but we give information only for 
the first one. 

For Hebei, the number of cases is not large. But the prediction still works very well (Fig. [5j 
Fig. El). 

For Singapore we observe three distinct inflexion points and two maximums (Fig. [?])• The 
prediction using the information at the first inflexion point fits well to the most part of the first 
peak (Fig. [5J. For Canada we observe two inflexion points (Fig. \§§ and when the information 
of the first inflexion point is used the prediction reproduces well the lower part of the observed 
curve but fails to predict the peak value (Fig. [TO)) . 

In summary, when the number is small, the error is large, showing that the thermodynamic 
approach is more accurate when the system is larger. 



Table 1: Predicted number H and date D compared with the reported ones for several cities or 
regions (Mainland refers to Mainland China). 



Regions 


Inflexion date 


Date D 


Maximum H 


Error 
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Pred 
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Pred 
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Beijing 


Apr27 


980 


116 


May 13 


Mayl5 


1955 


1991 


2% 


HK 


Mar28 


960 


49 


Apr 12 


Aprl4 


823 


960 


14% 


World 


Apr23 


2005 


222 


May9 
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4015 


3700 


8.5% 


Mainland 


Apr27 


1572 
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Figure 5: For the Province of Hebei of 
China the inflexion point is still identifiable. 
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Figure 6: The predicted curve is reasonable 
as compared to the reported one, though 
the number of cases in Hebei is not large. 
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Figure 7: Inflexion points for Singapore. 




Figure 8: Comparion between predicted 
and reported cases for Singapore. 
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Figure 9: Inflexion points for Canada. 



Figure 10: Comparison between predicted 
and reported cases for Canada. 



Table 2: Best fit value of a for several regions using the least square method, compared to the 
theoretical value a = 0.408. The agreement is reasonable. 



Regions 


Starting date 


Date D 


Maximum H 


Best fit a 


Beijing 


Mar 25/(10 days later) 


May 15 


1991 


0.349/0.462 


Hong Kong 


Feb. 20/(10 days later) 


Apr. 14 


960 


0.285/0.343 


World 


Feb. 20/(10 days later) 


May 12 


3700 


0.273/0.32 


Mainland China 


Mar 25/(10 days later) 


May 12 


3068 


0.307/0.404 


Hebei 


Apr. 17/(10 days later) 


May 13 


161 


0.367/0.357 


Singapore 


March 1/(10 days later) 


Apr. 15 


64 


0.409/0.491 


Canada 


Feb. 25/(10 days later) 


April 8 


84 


0.297/0.368 



3.3 Comparison between the theoretical value of a and the reported 
value of a 

It is interesting to note that the best fit value of a using the reported data is close to the 
theoretical one (a = 0.408) (Table 2). In fitting <j, the date D (counting from the starting 
date) and the maximum value H are fixed to be the values given by the reported data (third and 
fourth columns) so that only a is fitted. In the second column, the starting date is approximately 
the date when the first case was introduced into the region. The outbreak for the epidemic is 
assumed to take place within at most ten days so the best fit a is obtained by using two epidemic 
starting days (date with the introduction of the first case and latest possible outbreak date) . The 
range of best fit a (fifth column) is very close to the theoretical value 0.408 for Beijing and is 
not significantly different from the theoretical value for the other cities or regions. 

One would wonder if the use of a value a far beyond the theoretical value does not alter 
the curve significantly. In order to see that, we display in Fig. [TT] the role of a on the correct 
reproduction of the curve / = f(t). The log-normal curves using the thermodynamical value 
a = 0.408 and the best fit value a — 0.47 are all close to the reported data. However, when a 
is significantly different from the thermodynamical value, then the log-normal curve has a great 
departure from the reported data, as can be seen from the curves using a = 0.1 and a = 0.9. 
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Figure 11: Role of a on the correct reproduction of the curve / = /(£). 



This shows that the shape of the curve is quite sensitive to a and the theoretical value of a is 
indeed a rational one. 



4 Concluding remarks 

We have built a closed model for which we just need some data for the early period to determine 
the inflexion point L, the number f(L) and the increase rate df(L)/dt. The model is applied to 
predict the number for t > L and especially D and f(D) for the 2003 SARS and is hoped to 
work when the system for SARS or similar epidemic spread involves multiscale interventions and 
constitutes a thermodynamical system. Despite the possible uncertainty in the reported data 
for < t < L and that the model does not require epidemic details such as latent, incubation 
and infectious periods, the comparison between model prediction and reported SARS data is still 
good enough for the cities or regions where the epidemic is severe. The prediction for the case 
of Beijing is remarkably well since the number of cases is very large. This shows that when the 
system is large enough, the thermodynamic approach is more accurate. The actual model has 
some difficulty to exactly handle the case of multiple inflexion points. 

The present model can be possibly used to predict epidemics other than SARS once the 
communicable epidemics receive intensive interventions. The H7N9 avian influenza is actually of 
great concern |28l 129) and if unluckily this should spread rapidly, we expect the present model 
would be useful for predicting its spread. 

For avian influenza, the observed severe symptom mainly includes high fever and pneumonia 
[2"81 12"9"] . It is interesting to note that, according to traditional Chinese meridian doctrine [501 
131] . giving a pressure down or performing an acupuncture on specific acupuncture points on 
the meridian in a correct way by experts could be helpful to relieve or cure the corresponding 
symptoms (see Table [3]). 



Table 3: The symptoms and the corresponding meridian acupuncture points (reproduced from 
reference |31j). Pain is felt on the meridian acupuncture points while a pressure is given on that 
points, if the corresponding symptom exists 

Symptom Acupuncture (meridian) Figure 



Figcrj 



high fever zhongzhu (TE3; SJ3), 

tianfu (L3; LU3), 

laogong (P8; PC8), 

zhongchong (P9; PC9), 

dazhu (Bll; BL11), 

pneumonia danzhong (CV17; RN17), Fig [TBI 

dazhu (Bll; BL11), 

feishu (B13; BL13), 

yiinmen (L2; LU2), 

yiiji (L10; LU10) 




zhongchong 




9 _ 



(d) 



4! 

42 



43 — 



r r 

I 1 dazhu ---..__ 

-i • 12 

13 



^\^uU 



/- 



Figure 12: Acupuncture points related to high fever: (a) zhongzhu (shaoyang sanjiao meridian of 
hand), (b) tianfu (taiyin lung meridian of hand), (c) laogong and zhongchong (jueyin pericardium 
meridian of hand), (d) dazhu (taiyang bladder meridian of foot). 



10 




Figure 13: Acupuncture points related to pneumonia: (a) danzhong (ren meridian), (b) dazhu 
and feishu (taiyang bladder meridian of foot), (c) yunmen and yiiji (taiyin lung meridian of 
hand). 
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